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A MULTI-TIER METHOD OF CLASSIFYING SAMPLE SPECTRA FOR 
NON-INVASIVE BLOOD ANALYTE PREDICTION 



BACKGROUND OF THE INVENTION 



FIELD OF THE INVENTION 
The invention relates to non-invasive blood analyte prediction using Near IR tissue 
absorption spectra. More particularly, the invention relates to a method of 
classifying sample spectra into groups having a high degree of internal 
consistency to minimize prediction error due to spectral interferents. 

DESCRIPTION OF RELATED TECHNOLOGY 

The goal of noninvasive blood analyte measurement is to determine the 
concentration of targeted blood analytes without penetrating the skin. Near 
infrared (NIR) spectroscopy is a promising noninvasive technology that bases 
measurements on the absorbance of low energy NIR light transmitted into a 
subject. The light is focused onto a small area of the skin and propagates through 
subcutaneous tissue. The reflected or transmitted light that escapes and is 
detected by a spectrometer provides information about the contents of the tissue 
that the NIR light has penetrated and sampled. The absorption of light at each 
wavelength is determined by the structural properties and chemical composition of 
the tissue. Tissue layers, each containing a unique heterogeneous chemistry and 
particulate distribution, produce light absorption and scattering of the incident 
radiation. Chemical components such as water, protein, fat and blood analytes 
absorb light proportionally to their concentration through unique absorption 
profiles. The sample tissue spectrum contains information about the targeted 
analyte, as well as a large number of other substances that interfere with the 
measurement of the analyte. Consequently, analysis of the analyte signal requires 
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the development of a mathematical model for extraction of analyte spectral signal 
from the heavily overlapped spectral signatures of interfering substances. 
Defining a model that produces accurate compensation for numerous interferents 
may require spectral measurements at one hundred or more frequencies for a 
sizeable number of tissue samples. 

Accurate noninvasive estimation of blood analytes is also limited by the dynamic 
nature of the sample, the skin and living tissue of the patient. Chemical, structural 
and physiological variations occur produce dramatic changes in the optical 
properties of the measured tissue sample. See R. Anderson, J. Parrish. The 
optics of human skin, Journal of Investigative Dermatology, vol. 77(1), pp. 13-19 
(1981); and W. Cheong, S. Prahl, A Welch, A review of the optical properties of 
biological tissues, IEEE Journal of Quantum Electronics, vol. 26(12), pp. 2166- 
2185 (Dec. 1990); and D. Benaron, D. Ho, Imaging (NIRI) and quantitation (NIRS) 
in tissue using time-resolved spectrophotometry: the impact of statically and 
dynamically variable optical path lengths, SPIE. vol. 1888, pp.10-21 (1993); and J. 
Conway, K. Norris, C. Bodwell, A new approach for the estimation of body 
composition: infrared interactance, The American Journal of Clinical Nutrition, vol. 
40, pp. 1123-1140 (Dec. 1984); and S. Homma, T. Fukunaga, A Kagaya, Influence 
of adipose tissue thickness in near infrared spectroscopic signals in the 
measurement of human muscle, Journal of Biomedical Optics, vol. 1(4), pp. 418- 
424 (Oct. 1996); and A Profio, Light transport in tissue, Applied Optics, vol. 28(12), 
pp. 2216-2222 (June 1989); and M. Van Gemert, S. Jacques, H. Sterenborg, W. 
Sta, Skin optics, IEEE Transactions on Biomedical Engineering, vol. 36(12), pp. 
1146-1154 (Dec. 1989); and B. Wilson, S. Jacques, Optical reflectance and 
transmittance of tissues: principles and applications, IEEE Journal of Quantum 
Electronics, vol. 26(12), pp. 2186-2199. 

Overall sources of spectral variations include the following general categories: 
1. Co-variati on of spectrally interfering species. The near infrared spectral 
absorption profiles of blood analytes tend to overlap and vary 
simultaneously over brief time periods. This overlap leads to spectral 



-2- 



WO 02/23971 



PCTAJS01/26387 



interference and necessitates the measurement of absorbance at more 
independently varying wavelengths than the number of interfering species. 

2. Sample heterogeneity . The tissue measurement site has multiple layers 
and compartments of varied composition and scattering. The spectral 
absorbance versus wavelength measurement is related to a complex 
combination of the optical properties and composition of these tissue 
components. Therefore, the spectral response with changing blood analyte 
concentration is likely to deviate from a simple linear model. 

3. State Variations . Variations in the subject s physiological state effect the 
optical properties of tissue layers and compartments over a relatively short 
period of time. Such variations, for example, may be related to hydration 
levels, changes in the volume fraction of blood in the tissue, hormonal 
stimulation, skin temperature fluctuations and blood hemoglobin levels. 
Subtle variations may even be expected in response to contact with an 
optical probe. 

4. Structural Variations . The tissue characteristics of individuals differ as a 
result of factors that include hereditary, environmental influences, the aging 
process, sex and body composition. These differences are largely 
anatomical and can be described as slowly varying structural properties 
producing diverse tissue geometry. Consequently, the tissue of a given 
subject may have distinct systematic spectral absorbance features or 
patterns that can be related directly to specific characteristics such as 
dermal thickness, protein levels and percent body fat. While the 
absorbance features may be repeatable within a patient, the structural 
variations in a population of patients may not be amenable to the use of a 
single mathematical calibration model. Therefore, differences between 
patients are a significant obstacle to the noninvasive measurement of blood 
analytes through NIR spectral absorbance. 
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In a non-dispersive system, variations similar to (1) above are easily modeled 
through multivariate techniques such as multiple linear regression and factor- 
based algorithms. Significant effort has been expended to model the scattering 
properties of tissue in diffuse reflectance, although the problem outlined in (2) 
above has been largely unexplored. Variation of the type listed in (3) and (4) above 
causes significant nonlinear spectral response for which an effective solution has 
not been reported. For example, several reported methods of noninvasive glucose 
measurement develop calibration models that are specific to an individual over a 
short period of time. See K. Hazen, Glucose determination in biological matrices 
using near-infrared spectroscopy, Doctoral Dissertation, University of Iowa (Aug. 
1995); and J. Burmeister, In vitro model for human noninvasive blood glucose 
measurements, Doctoral Dissertation, University of Iowa (Dec. 1997); and M. 
Robinson, R. Eaton, D. Haaland, G. Koepp, E. Thomas, B. Stallard and P. 
Robinson, Noninvasive glucose monitoring in diabetic patients: a preliminary 
evaluation, Clin. Chem. vol. 38 (9), pp. 1618-1622 (1992). This approach avoids 
modeling the differences between patients and therefore cannot be generalized to 
more individuals. However, the calibration models have not been tested over long 
time periods during which variation of type (4) may require recalibration. v 
Furthermore, the reported methods have not been shown to be effective over a 
range of type (3) variations. 



SUMMARY OF THE INVENTION 

The invention provides a Multi-Tier method for classifying tissue absorbance 
spectra that compensates for variation in sample spectra due to co-variation of 
spectral interferents, sample heterogeneity, state variation and structural variation. 
Measurement spectra are associated with localized calibration models that are 
designed to produce the most accurate estimates for the patient at the time of 
measurement. Classification occurs through extracted features of the tissue 
absorbance spectrum related to the current patient state and structure. 
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The invention also provides a method of developing localized calibration models 
from tissue absorbance spectra from a representative population of patients or 
physiological states of individual patients that have been segregated into groups. 
The groups or classes are defined on the basis of structural and state similarity 
such that the variation in tissue characteristics within a class is smaller than the 
variation between classes. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides a representation of a Multi-Tiered Classification Tree structure, 
according to the invention; 

Figure 2 is a block diagram of the architecture of an intelligent system for the 
noninvasive measurement of blood analytes, according to the invention; 

Figure 3 is a block diagram of a pattern classification system, according to the 
invention; 

Figure 4 is a noninvasive absorbance spectrum collected using a diffuse 
reflectance NIR spectrometer; 

Figure 5 shows the spectra of repeated noninvasive measurements with no 
attempt to control tissue hydration; 

Figure 6 shows the spectra of repeated noninvasive measurements using 
ambient humidity to control hydration, according to the invention; 

Figure 7 shows a noninvasive absorbance spectrum having a pronounced fat 
band at 1710nm; 
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Figure 8 is a block schematic diagram of a general calibration system for mutually 
exclusive classes, according to the invention; 

Figure 8 is a block schematic diagram of a general calibration system for fuzzy 
class assignments, according to the invention; and 

Figure 10 is a block schematic diagram showing an example of parallel calibration 
models for fuzzy set assignments, according to the invention. 

DETAILED DESCRIPTION 
MULTI-TIERED CLASSIFICATION 

The classification of tissue samples using spectra and other electronic and 
demographic information can be approached using a wide variety of algorithms. A 
wide range of classifiers exists for separating tissue states into groups having 
high internal similarity: for example, Bayesian classifiers utilizing statistical 
distribution information; or nonparametric neural network classifiers that assume 
little a priori information. See K. Funkunaga, Intro to Statistical Pattern Recognition. 
Academic Pres, San Diego, CA (1990); and J. Hertz, A Krogh, R. Palmer, 
Introduction To The Theory Of Neural Computation. Addison-Wesley Publishing 
Co., Redwood City CA (1991). The multi-tiered classification approach selected 
here provides the opportunity to grow and expand the classification database as 
more data become available. The multi-tiered classifier is similar to a hierarchic 
classification tree, but unlike a classification tree, the decision rules can be 
defined by crisp or fuzzy functions and the classification algorithm used to define 
the decision rule can vary throughout the tree structure. 

Referring now to Figure 1, an example of a Multi-Tiered Classification scheme is 
represented. A first tier 1 1 assigns sample spectra according to pre-defined age 
groups: 18 —27 (15), 28 —40 (14), 40 —54 (13) and 55 - 80 years old (12). As 
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indicated, a sample has been assigned to the 28 - 40 age group. A second tier 16 
assigns samples to classes 18, 17 according to sex, in this case female. A third 
tier 19, groups according to stratum corneum hydration: 31 —60 (20);<30 (21) and 
>61 corneometer units (22); in this case, >61. A fourth tier 23, groups according to 
skin temperature: 88 — 90 (24); 86 — 88 (25); 84 — 86 and <84 degrees; lihis case 
84 — 86 degrees. In this way, a determination of classmembership is made within 
each tier in the multi-tiered structure. Finally, in a last tier 28, a final class 
assignment is made into one of three pre-defined groups 29, 30 and 31 according 
to relative optical thickness of the dermis. 

For economy s sake, only the branching adjacent the selected classes is 
completely shown in Figure 1, though there would be many more intermediate and 
final classification categories in a full multi-tiered classification structure. For 
example, at the fourth tier 23 of Figure, there would be ninety-six possible 
classifications for a tissue measurement spectrum; at the final tier, there would be 
two hundred eighty-eight possible classifications. The foregoing description of a 
Multi-Tier Classification structure is meant to be exemplary only. One skilled in the 
art will appreciate that an actual classification structure could have more or fewer 
tiers, and different decision rules could be utilized at each tier than have been 
utilized in the example. 



FEATURE EXTRACTION 

As previously indicated, at each tier in the classification structure, classification is 
made based on a priori knowledge of the sample, or on the basis of instrumental 
measurements made at the tissue measurement site. In the example of Figure 1, 
the first two tiers utilize a priori information about the sample: subject age and sex. 
Successive tiers utilize information gained from instrumental measurements at the 
tissue measurement site. Further classification occurs on the basis of extracted 
features from the tissue absorbance spectra themselves. 

Feature extraction is any mathematical transformation that enhances a quality or 
aspect of the sample measurement for interpretation. See R. Duda, P. Hart, 
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Pattern Classification and Scene Analysis. John Wiley and Sons, New York (1973). 
Figure 2 shows a block diagram of an intelligent measurement system for 
noninvasive blood analyte prediction, fully described in the parent application to the 
current application: S. Malin and T. Ruchti, An Intelligent System For Noninvasive 
Blood Analyte Prediction, U.S. Patent Application Ser. No. 09/359,191; July 22, 
1999, The purpose of feature extraction 41 in Figure 2 is to concisely represent the 
structural properties and physiological state of the tissue measurement site. The 
set of features is used to classify the patient and determine the calibration 
model(s) most useful for blood analyte prediction. 

The features are represented in a vector, zeSR* 7 that is determined from the 

preprocessed measurement through 

z = f{X,x) (!) 

where f: SR^SR* 7 is a mapping from the measurement space to the feature space. 
Decomposing /(•) will yield specific transformations, ffr): 9l N —>3t M i for determining 
a specific feature. The dimension, M h indicates whether the I th feature is a scalar 
or a vector and the aggregation of all features is the vector z. When a feature is -< 
represented as a vector or a pattern, it exhibits a certain structure indicative of an 
underlying physical phenomenon. 

The individual features are divided into two categories: 

1. abstract and 

2. simple. 

Abstract features do not necessarily have a specific interpretation related to the 
physical system. Specifically, the scores of a principal component analysis are 
useful features although their physical interpretation is not always known. The 
utility of the principal component analysis is related to the nature of the tissue 
absorbance spectrum. The most significant variation in the tissue spectral 
absorbance is not caused by a blood analyte but is related to the state, structure 
and composition of the measurement site. This variation is modeled by the 
primary principal components. Therefore, the leading principal components tend 
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to represent variation related to the structural properties and physiological state of 
the tissue measurement site. 



Simple features are derived from an a priori understanding of the sample and can 
be related directly to a physical phenomenon. Useful features that can be 
calculated from N1R spectral absorbance measurements include but are not 
limited to 

1. Thickness of adipose tissue. See J. Conway, K. Norris, C. Bodwell, A 
new approach for the estimation of body composition: infrared interactance, 
The American Journal of Clinical Nutrition , vol. 40, pp. 1123-1140 (Dec. 
1984) and S. Homma, T. Fukunaga, A Kagaya, Influence of adipose tissue 
thickness in near infrared spectroscopic signals in the measurement of 
human muscle, Journal of Biomedical Optics, vol. 1(4), pp. 418-424 (Oct 
1996). 

2. Tissue hydration. See K. Martin, Direct measurement of moisture in skin 
by NIR spectroscopy, J. Soc. Cosmet. Chem. . vol. 44, pp. 249-261 
(Sept/Oct. 1993). 

3. Magnitude of protein absorbance. See J. Conway, et a/., supra. 

4. Scattering properties of the tissue. See A. Profio, Light transport in tissue, 
Applied Optics, vol. 28(12), pp. 2216-2222 (June 1989) and W. Cheong, S. 
Prahl, A. Welch, A review of the optical properties of biological tissues, IEEE 
Journal of Quantum Electronics, vol. 26(12), pp. 2166-2185 (Dec. 1990); and 
R. Anderson, J. Parrish. The optics of human skin, Journal of Investigative 
Dermatology, vol. 77(1), pp. 13-19 (1981). 

5. Skin thickness. See Anderson, et a/., supra; and Van Gemmert, et a/., 
supra. 

6. Temperature related effects. See Funkunga, supra. 
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7. Age related effects. See W. Andrew, R. Behnke, T. Sato, Changes with 
advancing age in the cell population of human dermis, Gerontoloaia. vol. 10, 
pp. 1-19 (1964/65); and W. Montagna, K. Carlisle, Structural changes in 
aging human skin, The Journal of Investigative Dermatology, vol. 73, pp. 47- 
53 (1979; and 19 J. Brocklehurst, Textbook of Geriatric Medicine and 
Gerontology , pp.593-623, Churchill Livingstone, Edinburgh and London 
(1973). 



8. Spectral characteristics related to sex. See T. Ruchti, Internal Reports 
and Presentations, Instrumentation Metrics, Inc. 

9. Pathlength estimates. See R. Anderson, et a/., supra and S. Matcher, M. 
Cope, D. Delpy, Use of water absorption spectrum to quantify tissue 
chromophore concentration changes in near-infrared spectroscopy, Phys. 
Med. Biol. , vol. 38, pp. 177-196 (1993). 

10. Volume fraction of blood in tissue. See Wilson, et al. y supra, 

11. Spectral characteristics related to environmental influences. 

Spectral decomposition is employed to determine the features related to a known 
spectral absorbance pattern. Protein and fat, for example, have known 
absorbance signatures that can be used to determine their contribution to the 
tissue spectral absorbance. The measured contribution is used as a feature and 
represents the underlying variable through a single value. 

Features related to demographic information, such as age, are combinations of 
many different effects that cannot be represented by a single absorbance profile. 
Furthermore, the relationship of demographic variables and the tissue spectral 
absorbance is not deterministic. For example, dermal thickness and many other 
tissue properties are statistically related to age but also vary substantially as a 
result of hereditary and environmental influences. Therefore, factor based 
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methods are employed to build models capable of representing variation in the 
measured absorbance related to the demographic variable. The projection of a 
measured absorbance spectrum onto the model constitutes a feature that 
represents the spectral variation related to the demographic variable. The 
compilation of the abstract and simple features constitutes the A//-dimensional 
feature space. Due to redundancy of information across the set of features, 
optimum feature selection and/or data compression is applied to enhance the 
robustness of the classifier. 



CLASSIFICATION 

The goal of feature extraction is to define the salient characteristics of 
measurements that are relevant for classification. Feature extraction is performed 
at branching junctions of the multi-tiered classification tree structure. The goal of 
the classification step is to assign the calibration model(s) most appropriate for a 
particular noninvasive measurement. In this step the patient is assigned to one of 
many predefined classes for which a calibration model has been developed and 
tested. Since the applied calibration model is developed for similar tissue 
absorbance spectra, the blood analyte predictions are more accurate than those 
obtained from a universal calibration model. 

As depicted in Figure 3, pattern classification generally involves two steps: 

1. a mapping step in which a classification model 53 measures the 
similarity of the extracted features to predefined classes; and 

2. an assignment step in which a decision engine 54 assigns class 
membership. 

Within this framework, two general methods of classification are proposed. The 
first uses mutually exclusive classes and therefore assigns each measurement to 
one class. The second scheme utilizes a fuzzy classification system that allows 
class membership in more than one class simultaneously. Both methods rely on 
previously defined classes, as described below. 
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Class Definition 

The development of the classification system requires a data set of exemplar 
spectral measurements from a representative sampling of the population. Class 
definition is the assignment of the measurements in fhe exploratory data set to 
classes. After class definition, the measurements and class assignments are 
used to determine the mapping from the features to class assignments. 

Class definition is performed through either a supervised or an unsupervised 
approach. See Y. Pao, Adaptive Pattern Recognition and Neural Networks. 
Addison-Wesley Publishing Co., Reading MA (1989). In the supervised case, 
classes are defined through known differences in the jdata. The use of a priori 
information in this manner is the first step in supervised pattern recognition, which 
develops classification models when the class assignment is known. For 
example, the majority of observed spectral variation can be modeled by three 
abstract factors, which are related to several physical properties including body fat, 
tissue hydration and skin thickness. Categorizing patients on the basis of these 
three features produces eight different classes if each feature is assigned a "high" 
and "low" value. The drawback to this approach is that attention is not given to 
spectral similarity and the number of classes tends to increase exponentially with 
the number of features. 

Unsupervised methods rely solely on the spectral measurements to explore and 
develop clusters or natural groupings of the data in feature space. Such an 
analysis optimizes the within cluster homogeneity and the between cluster 
separation. Clusters formed from features with physical meaning can be 
interpreted based on the known underlying phenomenon causing variation in the 
feature space. However, cluster analysis does not utilize a priori information and 
can yield inconsistent results. 

A combination of the two approaches utilizes a priori knowledge and exploration of 
the feature space for naturally occurring spectral classes. In this approach, 
classes are first defined from the features in a supervised manner. Each set of 
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features is divided into two or more regions and classes are defined by 
combinations of the feature divisions. A cluster analysis is performed on the data 
and the results of the two approaches are compared. Systematically, the clusters 
are used to determine groups of classes that can be combined. After 
conglomeration, the number of final class definitions is significantly reduced 
according to natural divisions in the data. 

Subsequent to class definition, a classifier is designed through supervised pattern 
recognition. A model is created, based on class definitions, that transforms a 
measured set of features to an estimated classification. Since the ultimate goal of 
the classifier is to produce robust and accurate calibration models, an iterative 
approach must be followed in which class definitions are optimized to satisfy the 
specifications of the measurement system. 

Statistical Classification 

The statistical classification methods are applied to mutually exclusive classes 
whose variation can be described statistically. See J. Bezdek, S. Pal, eds, Fuzzy 
Models for Pattern Recognition. IEEE Press, Piscataway NJ (1992). Once class 
definitions have been assigned to a set of exemplary samples, the classifier is 
designed by determining an optimal mapping or transformation from the feature 
space to a class estimate which minimizes the number of misclassifications. The 
form of the mapping varies by method as does the definition of "optimal". Existing 
methods include linear Discriminant analysis, SIMCA, k nearest-neighbor and 
various forms of artificial neural networks. See Funkunaga, supra; and Hertz, et a/., 
supra] and Martin, supra; and Duda, &t a/., supra; and Pao, supra; and S. Wold, M. 
Sjostrom, SIMCA: A method for analyzing chemical data in terms of similarity and 
analogy, Chemometrics: Theory and Application, ed. B.R. Kowalski, ACS 
Symposium Series, vol. 52 (1977); and S. Haykin, Neural Networks: A 
Comprehensive Foundation. Prentice-Hall, Upper Saddle River NJ (1994). The 
result is a function or algorithm that maps the feature to a class, c, according to 

c = j(zj (2) 
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where c is an integer on the interval [1,F] and P is the number of classes. The 
class is used to select or adapt the calibration model as discussed in the 
Calibration Section. 

Fuzzy Classification 

While statistically based class definitions provide a set of classes applicable to 
blood analyte estimation, the optical properties of the tissue sample resulting in 
spectral variation change over a continuum of values. Therefore, the natural 
variation of tissue thickness, hydration levels and body fat content, among others, 
results in class overlap. Distinct class boundaries do not exist and many 
measurements are likely to fall between classes and have a statistically equal 
chance of membership in any of several classes. Therefore, "hard" class 
boundaries and mutually exclusive membership functions appear contrary to the 
nature of the target population. 

A more versatile method of class assignment is based on fuzzy set theory. See 
Bezdek, etaL, supra; and C. Chen, ed., Fuzzy Logic and Neural Network Handbook . 
IEEE Press, Piscataway NJ (1996); and L. Zadeh, Fuzzy Sets, Inform. Control, vol. 
8, pp. 338-353 (1965). Generally, membership in fuzzy sets is defined by a 
continuum of grades and a set of membership functions that map the feature 
space into the interval [0,1] for each class. The assigned membership grade 
represents the degree of class membership with "1" corresponding to the highest 
degree. Therefore, a sample can simultaneously be a member of more than one 
class. 

The mapping from feature space to a vector of class memberships is given by 

c k =f k (z) ( 2 

where k= 1,2, P, f k {.) is the membership function of the class, c*e[0,1] for all k 
and the vector ce$i p is the set of class memberships. The membership vector 
provides the degree of membership in each of the predefined classes and is 
passed to the calibration algorithm. 
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The design of membership functions utilizes fuzzy class definitions similar to the 
methods previously described. Fuzzy cluster analysis can be applied and several 
methods, differing according to structure and optimization approach can be used 
to develop the fuzzy classifier. All methods attempt to minimize the estimation error 
of the class membership over a population of samples. 

MULTI-TIERED CALIBRATION 

Blood analyte prediction occurs by the application of a calibration model to the 
preprocessed measurement as depicted in Figure 2. The proposed prediction 
system involves a calibration or a set of calibration models that are adaptable or 
selected on the basis of the classification step. 

DEVELOPMENT OF LOCALIZED CALIBRATION MODELS 

Accurate blood analyte prediction requires calibration models that are capable of 
compensating for the co-varying interferents, sample heterogeneity, state and 
structural variations encountered. Complex mixtures of chemically absorbing 
species that exhibit substantial spectral overlap between the system components 
are solvable only with the use of multivariate statistical models. However, 
prediction error increases with increasing variation in interferents that also co-vary 
with analyte concentration in calibration data. Therefore, blood analyte prediction is 
best performed on measurements exhibiting smaller interference variations that 
correlate poorly with analyte concentration in the calibration set data. Since it may 
not be possible to make all interference variations random, it is desirable to limit 
the range of spectral interferent variation in general. 

The principle behind the multi-tiered classification and calibration system is based 
on the properties of a generalized class of algorithms that are required to 
compensate for overlapped interfering signals in the presence of the desired 
analyte signal. See H. Martens, T. Naes, Multivariate Calibration. John Wiley and 
Sons, New York (1989). The models used in this application require the 
measurement of multiple independent variables, designated as x, to estimate a 
single dependent variable, designated as y. For example, y may be tissue glucose 
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concentration, and x may represent a vector, [x } x 2 x,- ], consisting of the 
noninvasive spectrum signal intensities at each of n wavelengths. 

The generalized form of a model to be used in the calculation of a single glucose 
estimate uses a weighted summation of the noninvasive spectrum as in Equation 
4. The weights, w y are referred to as the regression vector. 

y = (4) 

The weights define the calibration model and must be calculated from a given 
calibration set of noninvasive spectra in the spectral matrix X, and associated 
reference values y for each spectrum: 

W=(X T X) ! X T y. (5) 

The modeling error that might be expected in a multivariate system using Equation 
5 can be estimated using a linear additive mixture model. Linear additive mixtures 
are characterized by the definition that the sum of the pure spectra of the individual 
constituents in a mixture equals the spectra of the mixture. Linear mixture models 
are useful in assessing the general limitations of multivariate models that are 
based on linear additive systems and those, noninvasive blood analysis, for 
example, that can be expected to deviate somewhat from linear additive behavior. 

Figure 4 shows an exemplary noninvasive absorbance spectrum. A set of 
spectral measurements may be represented as a matrix X where each row 
corresponds to an individual sample spectrum and each column represents the 
signal magnitude at a single wavelength. The measurement matrix can be 
represented as a linear additive mixture model with a matrix of instrument baseline 
variations B 0 , a matrix of spectra of the pure components K, and the concentrations 
of the pure components, Y, and random measurement noise present in the 
measurement of each spectrum, E. 
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The linear additive model can be broken up further into interferents and analytes 
as an extended mixture model. 



X = B 0 + YK T + TP T + E (7) 

In equation 4, T is a matrix representing the concentration or magnitude of 
interferents in all samples, and P represents the pure spectra of the interfering 
substances or effects present. Any spectral distortion can be considered an 
interferent in this formulation. For example, the effects of variable sample 
scattering and deviations in optical sampling volume must be included as sources 
of interference in this formulation. The direct calibration for a generalized least 
squares model on analyte y is 

>i™=(J!f™jqr-jif™(r-fe) (8) 

where ™is defined as the covariance matrix of the interfering substances or 

spectral effects, 0 is defined as the measurement noise, x is the spectral 

measurement, and k 0 is the instrument baseline component present in the 
spectral measurement. 

TM =P T (liyp+<Iiug(&) (9) 

The derived mean squared error (MSE) of such a generalized least squares 
predictor is found in Martens, el aA, supm. 

MSEiyois) = trace (K T ™ l K)-' 
(10) 

Equation 7 describes the generalized limitations of least squares predictors in the 
presence of interferents. If K represents the concentrations of blood glucose, a 
basic interpretation of Equation 7 is: the mean squared error in glucose estimates 
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increases with increasing variation in interferences that also co-vary with glucose 
concentration in calibration data. Therefore, the accurate estimation of glucose is 
best performed on measurements exhibiting smaller interference variations that 
poorly correlate with glucose concentration in the calibration set data. Since it may 
not be possible to make all interference variations random with glucose, it is 
desirable to limit the range of spectral interference variation in general. 

The Multi-Tier Classification provides a method for limiting variation of spectral 
interferents by placing sample measurements into groups having a high degree of 
internal consistency. Groups are defined based on a priori knowledge of the 
sample, instrumental measurements at the tissue measurement site, and 
extracted features. With each successive tier, samples are further classified such 
that variation between spectra within a group is successively limited. Tissue 
parameters to be utilized in class definition may include: stratum corneum 
hydration, tissue temperature, and dermal thickness. 

TISSUE HYDRATION 

The stratum corneum (SC), or homy cell layer covers about 10-15 Im thickness 
of the underside of the arm. The SC is composed mainly of keratinous 
dead cells, water and some lipids. See D. Bommannan, R. Potts, R. Guy, 
Examination of the Stratum Corneum Barrier Function In Vivo by Infrared 
Spectroscopy, J. Invest. Dermatol. , vol. 95, pp 403-408 (1990). Hydration of the SC 
is known to vary over time as a function of room temperature and relative humidity. 
See J. Middleton, B. Allen, Influence of temperature and humidity on stratum 
corneum and its relation to skin chapping, J. Soc. Cosmet Chem. . vol. 24, pp. 239- 
43 (1973). Because it is the first tissue penetrated by the spectrometer incident 
beam, more photons sample the SC than any other part of the tissue sample. 
Therefore, the variation of a strong near IR absorber like water in the first layer of 
the tissue sample can act to change the wavelength and depth intensity profile of 
the photons penetrating beneath the SC layer. 
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The impact of changes in SC hydration can be observed by a simple experiment. 
In the first part of the experiment, the SC hydration is allowed to range freely with 
ambient conditions. In the second part of the experiment, variations in SC 
hydration are limited by controlling relative humidity to a high level at the skin 
surface prior to measurement. Noninvasive measurements using uncontrolled 
and controlled hydration experiments on a single individual are plotted in Figures 5 
and 6, respectively. Changes in the water band 61 at 1900nm can be used to 
assess changing surface hydration. It is apparent that the range of variation in the 
water band 61 at 1900nm is considerably narrower in Figure 6 than in Figure 5. 
Since surface hydration represents a large variable in the spectral measurement, it 
is a valuable component for use in categorizing similarity in tissue samples. 

TISSUE TEMPERATURE 

The temperature of the measured tissue volume varies from the core body 
temperature, at the deepest level of penetration, to the skin surface temperature, 
which is generally related to ambient temperature, location and the amount of 
clothing at the tissue measurement site. The spectrum of water, which comprises 
about 65% of living human tissue is the most dominant spectral component at all 
depths sampled in the 1100-2500 nm wavelength range. These two facts, along 
with the known temperature-induced shifting of the water band at 1450 nm, 
combine to substantially complicate the interpretation of information about many 
blood analytes, including glucose. It is apparent that a range of temperature states 
exist in the volume of sampled living tissue and that the range and distribution of 
states in the tissue depend on the skin surface temperature. Furthermore, the 
index of refraction of skin is known to change with temperature. Skin temperature 
may therefore be considered an important categorical variable for use in the Multi- 
Tier Classification to identify groups for the generation of calibration models and 
prediction. 

OPTICAL THICKNESS OF DERMIS 

Repeated optical sampling of the tissue is necessary to calibrate to blood 
constituents. Because blood represents but a part of human tissue, and blood 
analytes only reside in fractions of the tissue, changes in the optical sampling of 
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tissue may change the magnitude of the analyte signal for unchanging levels of 
blood analytes. This kind of a sampling effect may confound efforts at calibration 
by changing the signal strength for specific levels of analyte. 

Categorization of optical sampling depth is pursued by analyzing spectral marker 
bands of the different layers. For example, the first tissue layer under the skin is 
the subcutaneous adipose tissue, consisting mainly of fat. The strength of the fat 
absorbance band can be used to assess the relative photon flux that has 
penetrated to the subcutaneous tissue level. A more pronounced fat band means 
that a greater photon flux has reached the adipose tissue and returned to the 
detector. In Figure 7, spectra with pronounced 71 and normal 72 fat bands are 
presented. The most important use of the optical thickness is to assess the 
degree of hydration in the interior tissue sampled by the optical probe. Optical 
thickness may also be a strong function of gender and body type, therefore this 
property measurement would be useful for assessing interior hydration states 
within a single individual. 

The following sections describe the calibration system for the two types of 
classifiers, mutually exclusive and fuzzy. 

MUTUALLY EXCLUSIVE CLASSES 

In the general case, the designated classification is passed to a nonlinear model 
that provides a blood analyte prediction based on the patient classification and 
spectral measurement. This process, illustrated in Figure 8, involves the 
modification of the estimation strategy for the current subject according to the 
structural tissue properties and physiological state manifested in the absorbance 
spectrum. 

This general architecture necessitates a nonlinear calibration model 103 such as 
nonlinear partial least squares or artificial neural networks since the mapping is 
highly nonlinear. The blood analyte prediction for the preprocessed measurement 
x with classification specified by c is given by 
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y = S (c,x) (11 

) 

where is a nonlinear calibration model which maps x and c to an estimate of 
the blood analyte concentration, y . 

In the preferred realization, a different calibration is realized for each class. The 
estimated class is used to select one of p calibration models most appropriate for 
blood analyte prediction using the current measurement. Given that k is the class 
estimate for the measurement, the blood analyte prediction is 

y = & (*) , 

(12) 

where #*(♦) is the calibration model associated with the k? h class. 

The calibrations are developed from a set of exemplar absorbance spectra with 
reference blood analyte values and pre-assigned classification definitions. This 
set, denoted the calibration set , must have sufficient samples to completely 
represent the range of physiological states to be encountered in the patient 
population. The p different calibration models are developed individually from the 
measurements assigned to each of the p classes. The models are realized using 
known methods including principal component regression, partial least squares 
regression and artificial neural networks. See Hertz, et a/., supra] and Pao, supra] 
and Haykin, supra] and Martens, et a/., supra] and N. Draper, H. Smith, Applied 
Regression Analysis. 2 nd ed., John Wiley and Sons, New York (1981). The various 
models associated with each class are evaluated on the basis of an independent 
test set or cross validation and the best set of models are incorporated into the 
Multi-tier Classification. Each class of patients then has a calibration model 
specific to that class. 

FUZZY CLASS MEMBERSHIP 

When fuzzy classification is employed the calibration is passed a vector of 
memberships rather than a single estimated class. The vector, c, is utilized to 
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determine an adaptation of the calibration model suitable for blood analyte 
prediction or an optimal combination of several blood analyte predictions. In the 
general case, illustrated in Figure 9, the membership vector and the preprocessed 
absorbance spectrum are both used by a single calibration 111 for blood analyte 
prediction. The calculation is given by 

y = g(c,x) 

(13) 

where is a nonlinear mapping determined through nonlinear regression, 
nonlinear partial least squares or artificial neural networks. The mapping is 
developed from the calibration set described previously and is generally complex. 

The preferred realization, shown in Figure 10, has separate calibrations 121 for 
each class. However, each calibration is generated using all measurements in 
the calibration set by exploiting the membership vector assigned to each 
measurement. In addition, the membership vector is used to determine an 
optimal combination of the p blood analyte predictions. from all classes through 
defuzzification 122. Therefore, during calibration development, a given 
measurement of the calibration set has the opportunity to impact more than one 
calibration model. Similarly, during prediction more than one calibration model is 
used to generate the blood analyte estimate. 

Each of the p calibration models is developed using the entire set of calibration 
data. However, when the calibration model is calculated, the calibration 
measurements are weighted by their respective membership in the K h class. As a 
result, the influence of a sample on the calibration model of a particular class is a 
function of its membership in the class. 

In the linear case, weighted least squares is applied to calculate regression 
coefficients and, in the case of factor based methods, the covariance matrix. See 
Duda, et a/., supra. Given a matrix of absorbance spectra X k <E < 3i rxw and reference 
blood analyte concentrations Ye ST' where r is the number of measurement spectra 
and w is the number wavelengths, let the membership in class k of each 
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absorbance spectrum be the elements of C*e9T. Then the principal components 
are given by 

F = X k M , 

(14) 

where M is the matrix of the first n eigenvectors of P. The weighted covariance 
matrix P is determined through 

P = X k VX T k , 

(15) 

where V is a square matrix with the elements of C k on the diagonal. The 
regression matrix, 5, is determined through 

B=(F T VF) l F T Vy. 

(16) 

When an iterative method is applied, such as artificial neural networks, the 
membership is used to determine the frequency the samples are presented to the 
learning algorithm. Alternatively, an extended Kalman filter is applied with a 
covariance matrix scaled according to V. 

The purpose of defuzzification is to find an optimal combination of the p different 
blood analyte predictions, based on a measurement s membership vector that 
produces accurate blood analyte predictions. Therefore, defuzzification is a 
mapping from the vector of blood analyte predictions and the vector of class 
memberships to a single analyte prediction. The defuzzifier can be denoted as 
transformation such that 

y = <t&\yi y 2 y* — y P 3 , 

(16) 

where tf(.) is the defuzzification function, c is the class membership vector and y k is 
the blood analyte prediction of the U h calibration model. Existing methods of 
defuzzification, such as the centroid or weighted average, are applied for small 
calibration sets. However, if the number of samples is sufficient, </(•) is generated 
through a constrained nonlinear model. 
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INSTRUMENT DESCRIPTION 

The Multi-Tiered Classification and Calibration is implemented in a scanning 
spectrometer which determines the NIR absorbance spectrum of the subject 
forearm through a diffuse reflectance measurement. The instrument employs a 
quartz halogen lamp, a monochromator and InGAs detectors. The detected 
intensity from the sample is converted to a voltage through analog electronics and 
digitized through a 16-bit A/D converter. The spectrum is passed to the Intelligent 
Measuring System (IMS) for processing and results in either a glucose prediction 
or a message indicating an invalid scan. 

Although the invention is described herein with reference to the preferred 
embodiment, one skilled in the art will readily appreciate that other applications 
may be substituted for those set forth herein without departing from the spirit and 
scope of the present invention. Accordingly, the invention should only be limited by 
the claims included below. 
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CLAIMS 

What is claimed is: 

1. A multi-tier pattern classification method for estimating a level of a target 
blood analyte comprising the steps of: 

providing a measured tissue absorbance spectrum from a subject; 

initially, classifying said measured spectrum into previously defined classes 
based on a priori information pertaining to said subject; 

further classifying said measured spectrum into previously defined classes 
based on at least one instrumental measurement at a tissue measurement site at 
which at which an optical sample was taken for said tissue absorbance spectrum; 
and 

extracting at least one feature from said measured spectrum for still further 
classification. 

2. The pattern classification method of Claim 1, wherein said initial 
classification step comprises the steps of: 

in a first tier, classifying said measured spectrum into previously defined 
classes based on subject s age; and 

in a second tier, further classifying said measured spectrum into previously 
defined classes based on subject s sex. 

3. The pattern classification method of Claim 1, wherein said subsequent 
classification step further comprises the steps of; 

in a third tier, further classifying said measured spectrum into previously 
defined classes based on an estimation of stratum corneum hydration at said 
tissue measurement site; and 
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in a fourth tier, further classifying said measured spectrum into previously 
defined classes based on skin temperature at said tissue measurement site. 

4. The pattern classification method of Claim 3, wherein said stratum corneum 
hydration estimate is based on a measurement of ambient humidity at said tissue 
measurement site. 

5. The pattern classification method of Claim 1 , wherein said feature extraction 
step comprises any mathematical transformation that enhances a quality or 
aspect of sample measurement for interpretation to represent concisely structural 
properties and physiological state of a tissue measurement site, wherein a 
resulting set of features is used to classify a subject and determine a calibration 
model that is most useful for blood analyte prediction. 

6. The pattern classification method of Claim 5, wherein said features are 
represented in a vector, ze9l^ that is determined from a preprocessed 
measurement through: 

where f\ SR n -h>SR m is a mapping from a measurement space to a feature space, 
wherein decomposing /(•) yields specific transformations, f}(*):9( N — > < 3t M i for 
determining a specific feature, wherein the dimension M it indicates whether an f* 
feature is a scalar or a vector and an aggregation of all features is the vector z, and 
wherein a feature exhibits a certain structure indicative of an underlying physical 
phenomenon when said feature is represented as a vector or a pattern. 

7. The pattern classification method of Claim 6, wherein individual features are 
divided into two categories comprising: 

abstract features that do not necessarily have a specific interpretation 
related to a physical system; and 
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simple features that are derived from an a priori understanding of a sample 
and that can be related directly to a physical phenomenon. 

8. The pattern classification method of Claim 7, wherein features that can be 
calculated from NIR spectral absorbance measurements include any of: 

thickness of adipose tissue; 

hematocrit level; 

tissue hydration; 

magnitude of protein absorbance; 

scattering properties of said tissue; 

skin thickness; 

temperature related effects; 

age related effects; 

spectral characteristics; 

pathlength estimates; 

volume fraction of blood in tissue; and 

spectral characteristics related to environmental influences. 

9. The pattern classification method of Claim 1, further comprising the step of: 

employing spectral decomposition to determine features related to a known 
spectral absorbance pattern. 

1 0. The pattern classification method of Claim 1 , further comprising the step of: 

employing factor-based methods to build a model capable of representing 
variation in a measured absorbance related to a demographic variable; 

wherein projection of a measured absorption onto said model constitutes a 
feature that represents spectral variation related to said demographic variable. ' 
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11. The pattern classification method of Claim 1 t wherein said feature extraction 
step determines at least one calibration model that is most appropriate for 
measurement; 

wherein a subject is assigned to one of many predefined classes for which 
a calibration model has been developed and tested. 

12. The pattern classification method of Claim 1, further comprising the steps 
of; 

measuring the similarity of a feature to predefined classes; and 
assigning class membership. 

13: The pattern classification method of Claim 12, wherein said assigning step 
uses mutually exclusive classes and assigns each measurement to one class. 

14. The pattern classification method of Claim 12, wherein said assigning step 
uses a fuzzy classification that allows membership in more than one class 
simultaneously and provides a number between zero and one indicating a degree 
of membership in each class. 

15. The pattern classification method of Claim 1, further comprising the step of: 
assigning measurements in an exploratory data set to classes. 

16. The pattern classification method of Claim 15, further comprising the step 
of; 

using measurements and class assignments to determine a mapping from 
features to class assignments. 
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17. The pattern classification method of Claim 16, further comprising the steps 
of: 

defining classes from said features in a supervised manner, wherein each 
set of features is divided into two or more regions, and wherein classes are 
defined by combination of feature divisions; 

performing a cluster analysis on the spectral data to determine groups of 
said defined classes that can be combined, wherein the final number of class 
definitions is significantly reduced; 

designing a classifier subsequent to class definition through supervised 
pattern recognition by determining an optimal mapping or transformation from the 
feature space to a class estimate that minimizes the number of misclassifications; 
and 

creating a model based on class definitions that transforms a measured 
set of features to an estimated classification, wherein said class definitions are 
optimized to satisfy the specifications of the measurement system. 

18. The pattern classification method of Claim 17, wherein said optimal 
mapping utilizes any of linear discriminant analysis, SIMCA, k nearest-neighbor, 
and artificial neural networks. 

19. The pattern classification method of Claim 18, wherein a classification 
function maps said feature to a class c, according to 

c = J(z) 9 

where c is an integer on an interval [1, F\, where P equals the number of classes, 
and wherein said class is used to select or adapt a calibration model. 

20. The pattern classification method of Claim 19, further comprising the step 
of: 
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passing said classification to a nonlinear model that provides a blood 
analyte prediction based on said classification and spectral measurement, 
wherein said blood analyte prediction for a measurement x is given by: 

y = g(c 9 x) % 

where is a nonlinear calibration model that maps x and c to an estimate of 
blood analyte concentration, y. 



21. The pattern classification method of Claim 20, wherein a different calibration 
is realized for each class and wherein an estimate of blood analyte concentration 
for a measurement is given by: 

where is a the calibration model associated with the kf h class. 

22. The pattern classification method of Claim 17, wherein a membership 
function maps said feature space into an interval [0,1] for each class, wherein 
membership is defined by a continuum of grades, and wherein a mapping from 
feature space to a vector of class memberships is given by: 

where k = 1,2, P,f k (*) is the membership function of the kf h class, c*e[0,1) for all 
k and the vector ce9t p is the set of all class memberships. 

23. The pattern classification method of Claim 22, wherein blood analyte 
prediction occurs by application of a calibration model to a preprocessed 
measurement. 

24. The pattern classification method of Claim 23, wherein said calibration 
model comprises any of nonlinear regression, nonlinear partial least squares, and 
artificial neural networks. 
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25. The pattern classification method of Claim 24, wherein a calibration model 
is passed a vector of class memberships, where a vector, c, is used to determine 
an adaptation of said calibration model suitable for blood analyte prediction or an 
optimal combination of several blood analyte predictions. 

26. The pattern classification method of Claim 25, wherein a membership 
vector and preprocessed absorbance spectrum are both used by a single 
calibration model for blood analyte prediction where the calculation is given by: 

y = g( c,x) t 

where is a nonlinear mapping determined through any of nonlinear 

regression, nonlinear partial least squares, and artificial neural networks. 

27. The pattern classification method of Claim 26, wherein separate 
calibrations are used for each class; and wherein each calibration is generated 
using all measurements in a calibration set by exploiting a membership vector 
assigned to each measurement. 

28. The pattern classification method of Claim 27, wherein said membership 
vector is used to determine an optimal combination of p blood analyte predictions 
from all classes through defuzzification. 

29. The pattern classification method of Claim 28, wherein each of the p 
calibration models is developed using an entire calibration set 

30. The pattern classification method of Claim 29, wherein calibration 
measurements are weighted by their respective membership in a kf h class when a 
jfc"' calibration model is calculated, where weighted least squares is applied to 
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calculated regression coefficients in a linear case, and wherein a covariance 
matrix is used in a factor-based methods case. 

31 . The pattern classification method of Claim 30, wherein said defuzzification 
is a mapping form a vector of blood analyse predictions and a vector of class 
memberships to a single analyte prediction, wherein said defuzzifier can be 
denoted as a transformation such that: 

y = d(f,\yi y 2 .v 3 — y p $ , 

where </(•) is the defuzzification function, c is a class membership vector and y k is a 
blood analyte prediction of a K h calibration model. 



BNSOOCID: <WO 022397 1A2_L> 



-32- 



WO 02/23971 



PCT/US01/26387 



1/9 



\ 



3 d 



CO 



V 



CP 

O 

E 



O 
CD 

Ql 

to 

> 

"oo 
o 
> 
c 

"c 
o 



o 
oo 
I 

in 

CD 

o> 

O 



m 
i 

o 

Q> 

CD 

O 



52 



o 
i 

GO 

CD 
O 



n 



oo 



CD 

CD 

O 



o 

^ o 



o 

CO 



\ 





o 








V 




c 


o 


o 


in 


hydrati 



C\2 




Q_ 




E 




CD 


o 




en 


•E 


oo 




oo 













E 




CD 


oo 


-♦-» 


oo 




1 


c 


CO 


J* 


oo 


CO 






CO 






CO 






CD 






c 


£Z 




J* 


O 


CD 


o 










Ic 


m _q 


oup 


~o 






E 
i_ 


~o 
o 




CD 






Q 







CO 



CD 
O 



93 



BNSDOCID: <WO_ 



_0223971A2J_> 



WO 02/23971 PCT/US01/26387 



2/9 



CD 
-♦-» 

^CD 

o 

c o 

•< 

~o "to 

O LiJ 

o 
m 



<D 
CD 

a 
c: 
a 



o 



a> 
-»-» 
a 

a? 
-*— ' 

CO 

CO 



to 
CP 

3 
C 

.2 

cx 
o 

5 



a> 
o 
cr 



CD 



a 

a> "o 
> i- 

LU O 
O 
O 



c 

CD _ 

a 



.2 E 

-Q O 

O ^ 



a 

o 



CP 



co to 

CO 

a 



CD .2 

o 2 

CD 



o 

J1 

o 
o 
o 




c 

2 _ 



BNSDOCID: <WO 022397 1A2_I_> 



WO 02/23971 



3/9 



PCT/US01/26387 



Measurement 



Feature 
Extraction 



53 



54 



Classification 
(Prediction) 
Model 



Decision 
Engine 



=> Class 
Assignment 



FIG. 3 




1200 1400 1600 1800 2000 2200 2400 
wavelength, nm 



FIG. 4 



BNSDOCID: <WC^ 0223971A2J_> 



WO 02/23971 PCT/US01/26387 

4/9 




0 1200 1400 1600 1800 2000 2200 2400 

wavelength, nm 



FIG. 5 



BNSDOCID: <WO. 



022397 1A2_I_> 



WO 02/23971 PCT/US01/26387 



5/9 




1200 1400 1600 1800 2000 2200 2400 
wavelength, nm 



FIG. 6 



BNSDOCID: <WO 022397 1A2_l_> 



WO 02/23971 



6/9 



PCT/US01/26387 




1450 1500 1550 1600 1650 1700 1750 1800 1850 1900 
wavelength, nm 



FIG. 7 



BNSDOCID: <WO 0223971A2J__> 



WO 02/23971 



7/9 



PCT/USO 1/26387 



Measurement 






Prep roc 


:essing 



Patient 
Classification 




Blood Analyte 
Concentration 



FIG. 8 



BNSDOCID: <WO 022397 1A2_I_> 



WO 02/23971 



8/9 



PCT/US01/26387 



Measurement 



m 



Prep rocessing 



Fuzzy 
Patient 
Classification 




Blood Analyte 
Concentration 



FIG. 9 



WO 02/23971 



9/9 



PCT/US01/26387 ♦ 



Measurement 



m 



Preprocessing 



■121 



Fuzzy 
Patient 
Classification 



9i(x) 



92 00 



93 (x) 



9 P (x) 



Defuzzification 



122 



Blood Analyte 
Concentration 



FIG. 10 



BNSOOCID: 



022397 1A2_I_: 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 

(19) World Intellectual Property Organization ^KTflllnfcSs 
Internationa] Bureau 

(43) International Publication Date (10) International Publication Number 

28 March 2002 (28.03.2002) PCT WO 02/023971 A3 




nun iii 


mm 


mill 


mill 


iii 


mi 


mi 


lllllll 


nun 


iiiiiii 


in 



(51) International Patent Classification 7 : G06F 17/00, 
19/00, A6IB 5/00 

(21) International Application Number: PCT/US0 1/263 87 

(22) International Filing Date: 22 August 2001 (22.08.2001) 

(25) Filing Language: English 

(26) Publication Language: English 

(30) Priority Data: 

09/665,201 1 8 September 2000 (18.09.2000) US 

(71) Applicant: INSTRUMENTATION METRICS, INC. 

[US/US); 7470 West Chandler Blvd., Chandler, AZ 85226 
(US). 

(72) Inventors: MONFRE, Stephen, L.; 1289 E. Palo Blanco 
Way, Gilbert, AZ 85296 (US). BLANK, Thomas, B.; 2922 
E. Tulsa Street, Chandler, AZ 85225 (US). RUCHTI, Tim- 
othy, L.; 1501 West Sea Haze Drive, Tempe, AZ 85282 
(US). THENNADIL, Suresh; 13601 S. 44th Street #1047, 
Phoenix, AZ 85044 (US). 

(74) Agents: GLENN, Michael et aL; Glenn Patent Group, 
3475 Edison Way, Suite L., Menlo Park, CA 94025 (US). 



(81) Designated States (national): AE, AL, AM, AT, AU, AZ, 
BA, BB, BG, BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, 
DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, 
IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, 
LU, LV, MA, MD, MG, MK, MN, MW, MX, NO, NZ, PL, 
PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, 
UA, UG, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GQ, GW, ML, MR, NE, SN, TD, 
TG). 

Published: 

— with international search report 

— be/ore the expiration of the time limit for amending the 
claims and to be republished in the event of receipt of 
amendments 

(88) Date of publication of the international search report: 

23 October 2003 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(54) Title: A MULTI-TTER METHOD OF CLASSIFYING SAMPLE SPECTRA FOR NON-INVASIVE BLOOD ANALYTE PRE- 
DICTION 



1UI 



oge 18-27 



noninvasive Spectra 



oge 28-40 



| mala" 



1 



| oge 40~54.| [age 55-80 1 
12 / 



13-' 



female 



ion>01 | | 



30 i 
hydro! ionXil 



sc 

Itydf uliun<30 



SC hydration 
31-60 





bkin ttfnp 
<84 




I ikin (etqpfl 
[_ 84-86 || 




skin lemp 
86-88 


I skin Ihujj 






20- 




£5-' 





Dermal llricknws 
A 

Calibration 
Group A 



V 



Dermal thickness 




Oftrmnl thlfiknnss 


B 




C 


Calibration 




Calibration 


Group B 




Group C 







-28 



On 

o 



3t 

(57) Abstract: A method of multi-tier classification and calibration in noninvasive blood analyte prediction minimizes prediction 
error by limiting co-varying spectral interferents. Tissue samples are categorized based on subject demographic and instrumental skin 
measurements, including in vivo near-IR spectral measurements. A multi-tier intelligent partem classification sequence organizes 
spectral data into clusters having a high degree of internal consistency in tissue properties. In each tier, categories are successively 
refined using subject demographics, spectral measurement information and other device measurements suitable for developing tis- 
sue classifications. The multi-tier classification approach to calibration utilizes multivariate statistical arguments and multi-tiered 
classification using spectral features. Variables used in the multi-tiered classification can be skin surface. hydration, skin surface 
temperature, tissue volume hydration, and an assessment of relative optical thickness of the dermis by the near-IR fat band. All 
tissue parameters are evaluated using the NIR spectrum signal along key wavelength segments. 



BNSDOCID: <WO 022397 1A3_I_> 



INTERNATIONAL SEARCH REPORT 



International A ation No 

PCT/US 0^26387 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 7 G06F17/00 G06F19/00 A61B5/00 



According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 G06F A61B 



Documentation searched other than minimum documentation to the extenl that such documents are included In the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



WO 00 42907 A (INSTRUMENTATION METRICS 
INC) 27 July 2000 (2000-07-27) 
the whole document 



1-31 



US 5 725 480 A (SCHAPIRA THOMAS G 
10 March 1998 (1998-03-10) 
abstract; claims 

W0 92 22804 A (FUTREX INC) 
23 December 1992 (1992-12-23) 
page 4, line 13 -page 5, line 23 



ET AL) 



□ 



Further documents are fisted In the continuation of box C 



ID 



Patent family members ore listed in annox. 



■ Special categories of cited documents : 

"A* document defining the general state of the art which is not 
considered to be of particular relevance 

"E* earlier document but published on or after the international 
f fling date 

*L" document which may throw doubts on priority daim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O* document referring to an oral disclosure, use, exhibition or 
other means 

*P" document published prior to the international filing date but 
later than the priority date claimed 



T later document published after the International fifing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

'X' document of purlieu lai felevuiiue; (lie claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

*Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more olher such docu- 
ments, such combination being obvious to a person stalled 
in the art. 

'&' document member of the same patent family 



Date of the actual completion of the international search 



12 August 2003 



Date of maifing of the international search report 



20/08/2003 



Name and mailing address of the ISA 

European Patent Office. P.B. 5818 Patent laan 2 
NL-2280HVRijswijk 
Tel (+31-70) 340-2040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Authorized officer 



Gelebart, Y 



Form PCT/ISA/210 (second sheet) (July 1992) 
BNSDOCID: <WO 0223971 A3 J _> 



Informa »n patent family members 



International *-^*"-atlon No 

PCT/US Gx/ 26387 



Patent document 


Publication 




Patent family 




Publication 


cited in search report 


date 




lIlclilDci \oJ 




Ha to 
dealt* 


WO 0042907 A 


27-07-2000 


US 


6280381 


Bl 


28-08-2001 




AU 


754677 


B2 


21-11-2002 




AU 


2620900 


A 


07-08-2000 




BR 


0008186 


A 


06-11-2001 




CA 


2358473 


Al 


27-07-2000 




EP 


1143850 


Al 


17-10-2001 




JP 


2002535023 


T 


22-10-2002 




NZ 


513092 


A 


28-02-2003 




TW 


498156 


B 


11-08-2002 




WO 


0042907 


Al 


27-07-2000 




US 


6501982 


Bl 


31-12-2002 




US 


6405065 


Bl 


11-06-2002 




US 


6493566 


Bl 


10-12-2002 




US 


6587702 


Bl 


01-07-2003 




US 


2003060693 


Al 


27-03-2003 , 




US 


2003023148 


Al 


30-01-2003 




US 


6475800 


Bl 


05-11-2002 




US 


6456870 


Bl 


24-09-2002 




US 


6512936 


Bl 


28-01-2003 




us 


6442408 


Bl 


27-08-2002 




us 


2001021803 


Al 


13-09-2001 


US 5725480 A 


10-03-1998 


CA 


2247954 


Al 


12-09-1997 




EP 


0959765 


Al 


01-12-1999 




JP 


2000506048 


T 


23-05-2000 




WO 


9732521 


Al 


12-09-1997 


WO 9222804 A 


23-12-1992 


US 


5204532 


A 


20-04-1993 




AU 


2251292 


A 


12-01-1993 




CA 


2111868 


Al 


23-12-1992 




EP 


0590077 


Al 


06-04-1994 




JP 


6508440 


T 


22-09-1994 




MX 


9202953 


Al 


01-02-1993 




lie 
Uo 




A 


iy— ii— iyyt> 




WO 


9222804 


Al 


23-12-1992 




us 


6066847 


A 


23-05-2000 



Form PCT/iSA/210 (patent family annex) (July 1992) 
BNSDOCIO: <WO 0223971 A3 J _> 



